Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 17512 |
| Missing cells | 27435 |
| Missing cells (%) | 6.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.6 MiB |
| Average record size in memory | 156.0 B |
Variable types
| Numeric | 18 |
|---|---|
| Categorical | 5 |
grade is highly correlated with bathrooms and 3 other fields | High correlation |
sqft_basement is highly correlated with bathrooms and 4 other fields | High correlation |
bathrooms is highly correlated with grade and 3 other fields | High correlation |
bedrooms is highly correlated with sqft_above and 1 other fields | High correlation |
sqft_above is highly correlated with grade and 5 other fields | High correlation |
sqft_living15 is highly correlated with grade and 3 other fields | High correlation |
floors is highly correlated with yr_built | High correlation |
yr_renovated is highly correlated with jhygtf | High correlation |
yr_built is highly correlated with zipcode and 2 other fields | High correlation |
jhygtf is highly correlated with fue_renovada | High correlation |
sqft_lot is highly correlated with sqft_lot15 | High correlation |
price is highly correlated with sqft_basement | High correlation |
sqft_lot15 is highly correlated with sqft_lot | High correlation |
sqft_living is highly correlated with grade and 5 other fields | High correlation |
fue_renovada is highly correlated with jhygtf | High correlation |
view is highly correlated with waterfront | High correlation |
waterfront is highly correlated with view | High correlation |
zipcode is highly correlated with yr_built | High correlation |
condition is highly correlated with yr_built | High correlation |
sqft_basement has 10655 (60.8%) missing values | Missing |
yr_renovated has 16780 (95.8%) missing values | Missing |
df_index has unique values | Unique |
jhygtf has 16780 (95.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-02 00:11:03.949832 |
|---|---|
| Analysis finished | 2022-10-02 00:12:42.648424 |
| Duration | 1 minute and 38.7 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 17512 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18804.9329 |
| Minimum | 1 |
|---|---|
| Maximum | 113866 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1134.55 |
| Q1 | 6191.5 |
| median | 14492.5 |
| Q3 | 27229.25 |
| 95-th percentile | 51374.8 |
| Maximum | 113866 |
| Range | 113865 |
| Interquartile range (IQR) | 21037.75 |
Descriptive statistics
| Standard deviation | 16170.79161 |
|---|---|
| Coefficient of variation (CV) | 0.8599228563 |
| Kurtosis | 1.638998621 |
| Mean | 18804.9329 |
| Median Absolute Deviation (MAD) | 9638 |
| Skewness | 1.270745837 |
| Sum | 329311985 |
| Variance | 261494501.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19857 | 1 | < 0.1% |
| 21828 | 1 | < 0.1% |
| 27214 | 1 | < 0.1% |
| 3170 | 1 | < 0.1% |
| 1653 | 1 | < 0.1% |
| 14364 | 1 | < 0.1% |
| 548 | 1 | < 0.1% |
| 14745 | 1 | < 0.1% |
| 10503 | 1 | < 0.1% |
| 26758 | 1 | < 0.1% |
| Other values (17502) | 17502 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 12 | 1 | |
| 14 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 113866 | 1 | |
| 111906 | 1 | |
| 109571 | 1 | |
| 108311 | 1 | |
| 99297 | 1 | |
| 98325 | 1 | |
| 98195 | 1 | |
| 98015 | 1 | |
| 94768 | 1 | |
| 94083 | 1 |
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98077.85838 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98033 |
| median | 98065 |
| Q3 | 98117 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 53.39133895 |
|---|---|
| Coefficient of variation (CV) | 0.000544377088 |
| Kurtosis | -0.8484504301 |
| Mean | 98077.85838 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 0.4058979044 |
| Sum | 1717539456 |
| Variance | 2850.635074 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98103 | 492 | 2.8% |
| 98115 | 484 | 2.8% |
| 98038 | 474 | 2.7% |
| 98052 | 474 | 2.7% |
| 98034 | 453 | 2.6% |
| 98042 | 449 | 2.6% |
| 98117 | 449 | 2.6% |
| 98006 | 413 | 2.4% |
| 98118 | 409 | 2.3% |
| 98133 | 402 | 2.3% |
| Other values (60) | 13013 |
| Value | Count | Frequency (%) |
| 98001 | 295 | |
| 98002 | 161 | 0.9% |
| 98003 | 214 | |
| 98004 | 257 | |
| 98005 | 142 | 0.8% |
| 98006 | 413 | |
| 98007 | 117 | 0.7% |
| 98008 | 233 | |
| 98010 | 90 | 0.5% |
| 98011 | 153 | 0.9% |
| Value | Count | Frequency (%) |
| 98199 | 253 | |
| 98198 | 223 | |
| 98188 | 108 | 0.6% |
| 98178 | 213 | |
| 98177 | 203 | |
| 98168 | 218 | |
| 98166 | 204 | |
| 98155 | 372 | |
| 98148 | 51 | 0.3% |
| 98146 | 229 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.654179991 |
| Minimum | 1 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.170412516 |
|---|---|
| Coefficient of variation (CV) | 0.1529115486 |
| Kurtosis | 1.270700385 |
| Mean | 7.654179991 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7758726345 |
| Sum | 134040 |
| Variance | 1.369865457 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 7298 | |
| 8 | 4945 | |
| 9 | 2122 | 12.1% |
| 6 | 1635 | 9.3% |
| 10 | 890 | 5.1% |
| 11 | 315 | 1.8% |
| 5 | 192 | 1.1% |
| 12 | 76 | 0.4% |
| 4 | 24 | 0.1% |
| 13 | 11 | 0.1% |
| Other values (2) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 24 | 0.1% |
| 5 | 192 | 1.1% |
| 6 | 1635 | 9.3% |
| 7 | 7298 | |
| 8 | 4945 | |
| 9 | 2122 | 12.1% |
| 10 | 890 | 5.1% |
| 11 | 315 | 1.8% |
| Value | Count | Frequency (%) |
| 13 | 11 | 0.1% |
| 12 | 76 | 0.4% |
| 11 | 315 | 1.8% |
| 10 | 890 | 5.1% |
| 9 | 2122 | 12.1% |
| 8 | 4945 | |
| 7 | 7298 | |
| 6 | 1635 | 9.3% |
| 5 | 192 | 1.1% |
| 4 | 24 | 0.1% |
| Distinct | 286 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 10655 |
| Missing (%) | 60.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 745.5457197 |
| Minimum | 10 |
|---|---|
| Maximum | 4820 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 190 |
| Q1 | 450 |
| median | 700 |
| Q3 | 980 |
| 95-th percentile | 1460 |
| Maximum | 4820 |
| Range | 4810 |
| Interquartile range (IQR) | 530 |
Descriptive statistics
| Standard deviation | 407.2904165 |
|---|---|
| Coefficient of variation (CV) | 0.5462983768 |
| Kurtosis | 3.894117059 |
| Mean | 745.5457197 |
| Median Absolute Deviation (MAD) | 260 |
| Skewness | 1.149114174 |
| Sum | 5112207 |
| Variance | 165885.4834 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 190 | 1.1% |
| 500 | 178 | 1.0% |
| 700 | 172 | 1.0% |
| 800 | 163 | 0.9% |
| 400 | 145 | 0.8% |
| 900 | 122 | 0.7% |
| 1000 | 120 | 0.7% |
| 300 | 108 | 0.6% |
| 480 | 92 | 0.5% |
| 530 | 88 | 0.5% |
| Other values (276) | 5479 | |
| (Missing) | 10655 |
| Value | Count | Frequency (%) |
| 10 | 2 | < 0.1% |
| 20 | 1 | < 0.1% |
| 40 | 4 | < 0.1% |
| 50 | 7 | < 0.1% |
| 60 | 10 | 0.1% |
| 65 | 1 | < 0.1% |
| 70 | 5 | < 0.1% |
| 80 | 14 | |
| 90 | 17 | |
| 100 | 31 |
| Value | Count | Frequency (%) |
| 4820 | 1 | |
| 4130 | 1 | |
| 3500 | 1 | |
| 3480 | 1 | |
| 3260 | 1 | |
| 3000 | 1 | |
| 2810 | 1 | |
| 2730 | 1 | |
| 2720 | 1 | |
| 2620 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 992.0 KiB |
| 0 | |
|---|---|
| 2 | 775 |
| 3 | 419 |
| 1 | 280 |
| 4 | 251 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17512 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17512 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17512 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15787 | |
| 2 | 775 | 4.4% |
| 3 | 419 | 2.4% |
| 1 | 280 | 1.6% |
| 4 | 251 | 1.4% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.746745089 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 65 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7324289725 |
|---|---|
| Coefficient of variation (CV) | 0.4193107381 |
| Kurtosis | 2.026467896 |
| Mean | 1.746745089 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9094449525 |
| Sum | 30589 |
| Variance | 0.5364521998 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 8544 | |
| 1 | 6798 | |
| 3 | 1788 | 10.2% |
| 4 | 264 | 1.5% |
| 0 | 65 | 0.4% |
| 5 | 39 | 0.2% |
| 6 | 12 | 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 65 | 0.4% |
| 1 | 6798 | |
| 2 | 8544 | |
| 3 | 1788 | 10.2% |
| 4 | 264 | 1.5% |
| 5 | 39 | 0.2% |
| 6 | 12 | 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 6 | 12 | 0.1% |
| 5 | 39 | 0.2% |
| 4 | 264 | 1.5% |
| 3 | 1788 | 10.2% |
| 2 | 8544 | |
| 1 | 6798 | |
| 0 | 65 | 0.4% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.372144815 |
| Minimum | 0 |
|---|---|
| Maximum | 33 |
| Zeros | 11 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 33 |
| Range | 33 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9361956802 |
|---|---|
| Coefficient of variation (CV) | 0.2776261791 |
| Kurtosis | 58.60790948 |
| Mean | 3.372144815 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.269372602 |
| Sum | 59053 |
| Variance | 0.8764623517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 7928 | |
| 4 | 5599 | |
| 2 | 2236 | 12.8% |
| 5 | 1304 | 7.4% |
| 6 | 217 | 1.2% |
| 1 | 166 | 0.9% |
| 7 | 32 | 0.2% |
| 0 | 11 | 0.1% |
| 8 | 11 | 0.1% |
| 10 | 3 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 11 | 0.1% |
| 1 | 166 | 0.9% |
| 2 | 2236 | 12.8% |
| 3 | 7928 | |
| 4 | 5599 | |
| 5 | 1304 | 7.4% |
| 6 | 217 | 1.2% |
| 7 | 32 | 0.2% |
| 8 | 11 | 0.1% |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 3 | < 0.1% |
| 8 | 11 | 0.1% |
| 7 | 32 | 0.2% |
| 6 | 217 | 1.2% |
| 5 | 1304 | 7.4% |
| 4 | 5599 | |
| 3 | 7928 |
| Distinct | 856 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1789.444838 |
| Minimum | 290 |
|---|---|
| Maximum | 9410 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 850 |
| Q1 | 1190 |
| median | 1560 |
| Q3 | 2220 |
| 95-th percentile | 3370 |
| Maximum | 9410 |
| Range | 9120 |
| Interquartile range (IQR) | 1030 |
Descriptive statistics
| Standard deviation | 825.4332172 |
|---|---|
| Coefficient of variation (CV) | 0.4612789396 |
| Kurtosis | 3.405498552 |
| Mean | 1789.444838 |
| Median Absolute Deviation (MAD) | 450 |
| Skewness | 1.437022533 |
| Sum | 31336758 |
| Variance | 681339.996 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1200 | 168 | 1.0% |
| 1300 | 160 | 0.9% |
| 1010 | 159 | 0.9% |
| 1400 | 155 | 0.9% |
| 1340 | 151 | 0.9% |
| 1220 | 149 | 0.9% |
| 1180 | 146 | 0.8% |
| 1140 | 145 | 0.8% |
| 1060 | 145 | 0.8% |
| 1100 | 140 | 0.8% |
| Other values (846) | 15994 |
| Value | Count | Frequency (%) |
| 290 | 1 | < 0.1% |
| 380 | 1 | < 0.1% |
| 384 | 1 | < 0.1% |
| 390 | 1 | < 0.1% |
| 420 | 2 | |
| 430 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 470 | 2 | |
| 480 | 4 | |
| 490 | 2 |
| Value | Count | Frequency (%) |
| 9410 | 1 | |
| 8570 | 1 | |
| 8020 | 1 | |
| 7880 | 1 | |
| 7850 | 1 | |
| 7680 | 1 | |
| 7420 | 1 | |
| 7320 | 1 | |
| 6660 | 1 | |
| 6640 | 1 |
| Distinct | 722 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1985.622316 |
| Minimum | 399 |
|---|---|
| Maximum | 6210 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 399 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 1490 |
| median | 1840 |
| Q3 | 2370 |
| 95-th percentile | 3290 |
| Maximum | 6210 |
| Range | 5811 |
| Interquartile range (IQR) | 880 |
Descriptive statistics
| Standard deviation | 684.3686073 |
|---|---|
| Coefficient of variation (CV) | 0.3446620245 |
| Kurtosis | 1.619937303 |
| Mean | 1985.622316 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 1.10646386 |
| Sum | 34772218 |
| Variance | 468360.3907 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1560 | 162 | 0.9% |
| 1440 | 161 | 0.9% |
| 1540 | 157 | 0.9% |
| 1500 | 152 | 0.9% |
| 1460 | 147 | 0.8% |
| 1580 | 141 | 0.8% |
| 1720 | 141 | 0.8% |
| 1620 | 137 | 0.8% |
| 1480 | 136 | 0.8% |
| 1520 | 135 | 0.8% |
| Other values (712) | 16043 |
| Value | Count | Frequency (%) |
| 399 | 1 | < 0.1% |
| 460 | 1 | < 0.1% |
| 620 | 2 | < 0.1% |
| 670 | 1 | < 0.1% |
| 690 | 2 | < 0.1% |
| 700 | 2 | < 0.1% |
| 710 | 1 | < 0.1% |
| 720 | 2 | < 0.1% |
| 740 | 5 | |
| 750 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6210 | 1 | < 0.1% |
| 6110 | 1 | < 0.1% |
| 5790 | 5 | |
| 5610 | 1 | < 0.1% |
| 5600 | 1 | < 0.1% |
| 5380 | 1 | < 0.1% |
| 5340 | 1 | < 0.1% |
| 5330 | 1 | < 0.1% |
| 5220 | 1 | < 0.1% |
| 5200 | 1 | < 0.1% |
lat
Real number (ℝ≥0)
| Distinct | 4861 |
|---|---|
| Distinct (%) | 27.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4470.201909 |
| Minimum | 47.1559 |
|---|---|
| Maximum | 47777 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 47.1559 |
|---|---|
| 5-th percentile | 47.313855 |
| Q1 | 47.4852 |
| median | 47.59465 |
| Q3 | 47.6989 |
| 95-th percentile | 47559 |
| Maximum | 47777 |
| Range | 47729.8441 |
| Interquartile range (IQR) | 0.2137 |
Descriptive statistics
| Standard deviation | 13805.58766 |
|---|---|
| Coefficient of variation (CV) | 3.088358857 |
| Kurtosis | 5.848568351 |
| Mean | 4470.201909 |
| Median Absolute Deviation (MAD) | 0.10655 |
| Skewness | 2.801391482 |
| Sum | 78282175.84 |
| Variance | 190594250.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.5518 | 14 | 0.1% |
| 47.6842 | 14 | 0.1% |
| 47.5322 | 14 | 0.1% |
| 47.5445 | 13 | 0.1% |
| 47.5402 | 13 | 0.1% |
| 47.6711 | 13 | 0.1% |
| 47.6916 | 13 | 0.1% |
| 47.6727 | 13 | 0.1% |
| 47.6955 | 13 | 0.1% |
| 47686 | 13 | 0.1% |
| Other values (4851) | 17379 |
| Value | Count | Frequency (%) |
| 47.1559 | 1 | |
| 47.1593 | 1 | |
| 47.1622 | 1 | |
| 47.1647 | 1 | |
| 47.1764 | 1 | |
| 47.1775 | 1 | |
| 47.1776 | 2 | |
| 47.1795 | 1 | |
| 47.1808 | 1 | |
| 47.1853 | 1 |
| Value | Count | Frequency (%) |
| 47777 | 2 | < 0.1% |
| 47776 | 7 | |
| 47775 | 3 | < 0.1% |
| 47774 | 9 | |
| 47773 | 5 | |
| 47772 | 4 | |
| 47771 | 3 | < 0.1% |
| 47769 | 1 | < 0.1% |
| 47768 | 5 | |
| 47767 | 2 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 992.0 KiB |
| 0 | |
|---|---|
| 1 | 134 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17512 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17512 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17512 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 17378 | |
| 1 | 134 | 0.8% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.492947693 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.5 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5403790699 |
|---|---|
| Coefficient of variation (CV) | 0.3619544559 |
| Kurtosis | -0.4743401164 |
| Mean | 1.492947693 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.6241077506 |
| Sum | 26144.5 |
| Variance | 0.2920095392 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8681 | |
| 2 | 6639 | |
| 1.5 | 1550 | 8.9% |
| 3 | 497 | 2.8% |
| 2.5 | 138 | 0.8% |
| 3.5 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 8681 | |
| 1.5 | 1550 | 8.9% |
| 2 | 6639 | |
| 2.5 | 138 | 0.8% |
| 3 | 497 | 2.8% |
| 3.5 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 7 | < 0.1% |
| 3 | 497 | 2.8% |
| 2.5 | 138 | 0.8% |
| 2 | 6639 | |
| 1.5 | 1550 | 8.9% |
| 1 | 8681 |
| Distinct | 69 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 16780 |
| Missing (%) | 95.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1995.711749 |
| Minimum | 1934 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 1934 |
|---|---|
| 5-th percentile | 1963 |
| Q1 | 1987 |
| median | 2000 |
| Q3 | 2008 |
| 95-th percentile | 2014 |
| Maximum | 2015 |
| Range | 81 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 15.91513828 |
|---|---|
| Coefficient of variation (CV) | 0.00797466783 |
| Kurtosis | 0.9341076887 |
| Mean | 1995.711749 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -1.063180943 |
| Sum | 1460861 |
| Variance | 253.2916265 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 76 | 0.4% |
| 2013 | 32 | 0.2% |
| 2003 | 28 | 0.2% |
| 2000 | 28 | 0.2% |
| 2005 | 27 | 0.2% |
| 2007 | 27 | 0.2% |
| 1990 | 23 | 0.1% |
| 2006 | 22 | 0.1% |
| 2004 | 18 | 0.1% |
| 2009 | 18 | 0.1% |
| Other values (59) | 433 | 2.5% |
| (Missing) | 16780 |
| Value | Count | Frequency (%) |
| 1934 | 1 | < 0.1% |
| 1940 | 2 | |
| 1944 | 1 | < 0.1% |
| 1945 | 2 | |
| 1946 | 2 | |
| 1948 | 1 | < 0.1% |
| 1950 | 2 | |
| 1951 | 1 | < 0.1% |
| 1953 | 3 | |
| 1954 | 2 |
| Value | Count | Frequency (%) |
| 2015 | 13 | 0.1% |
| 2014 | 76 | |
| 2013 | 32 | |
| 2012 | 9 | 0.1% |
| 2011 | 8 | < 0.1% |
| 2010 | 14 | 0.1% |
| 2009 | 18 | 0.1% |
| 2008 | 16 | 0.1% |
| 2007 | 27 | 0.2% |
| 2006 | 22 | 0.1% |
| Distinct | 116 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1970.973561 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1951.75 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2010 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 45.25 |
Descriptive statistics
| Standard deviation | 29.33339767 |
|---|---|
| Coefficient of variation (CV) | 0.01488269465 |
| Kurtosis | -0.6547921683 |
| Mean | 1970.973561 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.4705211326 |
| Sum | 34515689 |
| Variance | 860.4482188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 441 | 2.5% |
| 2005 | 370 | 2.1% |
| 2006 | 364 | 2.1% |
| 2004 | 357 | 2.0% |
| 2003 | 346 | 2.0% |
| 2007 | 344 | 2.0% |
| 1977 | 333 | 1.9% |
| 1978 | 326 | 1.9% |
| 1968 | 305 | 1.7% |
| 2008 | 295 | 1.7% |
| Other values (106) | 14031 |
| Value | Count | Frequency (%) |
| 1900 | 67 | |
| 1901 | 25 | 0.1% |
| 1902 | 24 | 0.1% |
| 1903 | 36 | |
| 1904 | 37 | |
| 1905 | 56 | |
| 1906 | 74 | |
| 1907 | 61 | |
| 1908 | 69 | |
| 1909 | 70 |
| Value | Count | Frequency (%) |
| 2015 | 31 | 0.2% |
| 2014 | 441 | |
| 2013 | 158 | 0.9% |
| 2012 | 129 | 0.7% |
| 2011 | 108 | 0.6% |
| 2010 | 115 | 0.7% |
| 2009 | 182 | |
| 2008 | 295 | |
| 2007 | 344 | |
| 2006 | 364 |
long
Real number (ℝ)
| Distinct | 733 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -109531.765 |
| Minimum | -122519 |
|---|---|
| Maximum | -121.48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 17512 |
| Negative (%) | 100.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | -122519 |
|---|---|
| 5-th percentile | -122385.45 |
| Q1 | -122318 |
| median | -122203 |
| Q3 | -122059 |
| 95-th percentile | -122.23 |
| Maximum | -121.48 |
| Range | 122397.52 |
| Interquartile range (IQR) | 259 |
Descriptive statistics
| Standard deviation | 37250.64296 |
|---|---|
| Coefficient of variation (CV) | -0.340089863 |
| Kurtosis | 4.744687022 |
| Mean | -109531.765 |
| Median Absolute Deviation (MAD) | 122 |
| Skewness | 2.596922767 |
| Sum | -1918120269 |
| Variance | 1387610401 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.29 | 93 | 0.5% |
| -122.3 | 86 | 0.5% |
| -122362 | 85 | 0.5% |
| -122291 | 81 | 0.5% |
| -122363 | 80 | 0.5% |
| -122.35 | 80 | 0.5% |
| -122304 | 79 | 0.5% |
| -122285 | 79 | 0.5% |
| -122357 | 77 | 0.4% |
| -122351 | 77 | 0.4% |
| Other values (723) | 16695 |
| Value | Count | Frequency (%) |
| -122519 | 1 | < 0.1% |
| -122514 | 1 | < 0.1% |
| -122512 | 1 | < 0.1% |
| -122511 | 2 | |
| -122509 | 1 | < 0.1% |
| -122507 | 1 | < 0.1% |
| -122506 | 1 | < 0.1% |
| -122505 | 3 | |
| -122504 | 2 | |
| -122503 | 2 |
| Value | Count | Frequency (%) |
| -121.48 | 1 | < 0.1% |
| -121.73 | 2 | < 0.1% |
| -121.75 | 1 | < 0.1% |
| -121.76 | 1 | < 0.1% |
| -121.77 | 8 | |
| -121.78 | 4 | |
| -121.8 | 1 | < 0.1% |
| -121.81 | 1 | < 0.1% |
| -121.82 | 1 | < 0.1% |
| -121.84 | 1 | < 0.1% |
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.42056875 |
| Minimum | 0 |
|---|---|
| Maximum | 2015 |
| Zeros | 16780 |
| Zeros (%) | 95.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2015 |
| Range | 2015 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 399.4297204 |
|---|---|
| Coefficient of variation (CV) | 4.788144295 |
| Kurtosis | 18.97903169 |
| Mean | 83.42056875 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.579873065 |
| Sum | 1460861 |
| Variance | 159544.1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 2014 | 76 | 0.4% |
| 2013 | 32 | 0.2% |
| 2003 | 28 | 0.2% |
| 2000 | 28 | 0.2% |
| 2005 | 27 | 0.2% |
| 2007 | 27 | 0.2% |
| 1990 | 23 | 0.1% |
| 2006 | 22 | 0.1% |
| 2004 | 18 | 0.1% |
| Other values (60) | 451 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1934 | 1 | < 0.1% |
| 1940 | 2 | < 0.1% |
| 1944 | 1 | < 0.1% |
| 1945 | 2 | < 0.1% |
| 1946 | 2 | < 0.1% |
| 1948 | 1 | < 0.1% |
| 1950 | 2 | < 0.1% |
| 1951 | 1 | < 0.1% |
| 1953 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 13 | 0.1% |
| 2014 | 76 | |
| 2013 | 32 | |
| 2012 | 9 | 0.1% |
| 2011 | 8 | < 0.1% |
| 2010 | 14 | 0.1% |
| 2009 | 18 | 0.1% |
| 2008 | 16 | 0.1% |
| 2007 | 27 | 0.2% |
| 2006 | 22 | 0.1% |
| Distinct | 8441 |
|---|---|
| Distinct (%) | 48.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14943.17988 |
| Minimum | 520 |
|---|---|
| Maximum | 1651359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 520 |
|---|---|
| 5-th percentile | 1821.55 |
| Q1 | 5026 |
| median | 7620 |
| Q3 | 10711.5 |
| 95-th percentile | 42782.95 |
| Maximum | 1651359 |
| Range | 1650839 |
| Interquartile range (IQR) | 5685.5 |
Descriptive statistics
| Standard deviation | 41280.84382 |
|---|---|
| Coefficient of variation (CV) | 2.762520706 |
| Kurtosis | 322.4926422 |
| Mean | 14943.17988 |
| Median Absolute Deviation (MAD) | 2639 |
| Skewness | 13.90258241 |
| Sum | 261684966 |
| Variance | 1704108067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 297 | 1.7% |
| 6000 | 229 | 1.3% |
| 4000 | 207 | 1.2% |
| 7200 | 177 | 1.0% |
| 7500 | 100 | 0.6% |
| 4800 | 99 | 0.6% |
| 4500 | 96 | 0.5% |
| 8400 | 92 | 0.5% |
| 9600 | 92 | 0.5% |
| 3600 | 86 | 0.5% |
| Other values (8431) | 16037 |
| Value | Count | Frequency (%) |
| 520 | 1 | |
| 600 | 1 | |
| 609 | 1 | |
| 635 | 1 | |
| 638 | 1 | |
| 649 | 2 | |
| 651 | 1 | |
| 676 | 1 | |
| 681 | 1 | |
| 683 | 1 |
| Value | Count | Frequency (%) |
| 1651359 | 1 | |
| 1164794 | 1 | |
| 1074218 | 1 | |
| 1024068 | 1 | |
| 982998 | 1 | |
| 982278 | 1 | |
| 881654 | 1 | |
| 871200 | 1 | |
| 843309 | 1 | |
| 715690 | 1 |
| Distinct | 3525 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40341863.04 |
| Minimum | 75000 |
|---|---|
| Maximum | 4668000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 75000 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 320900 |
| median | 450000 |
| Q3 | 645000 |
| 95-th percentile | 1277222.5 |
| Maximum | 4668000000 |
| Range | 4667925000 |
| Interquartile range (IQR) | 324100 |
Descriptive statistics
| Standard deviation | 253858961.5 |
|---|---|
| Coefficient of variation (CV) | 6.292693057 |
| Kurtosis | 64.70307161 |
| Mean | 40341863.04 |
| Median Absolute Deviation (MAD) | 150000 |
| Skewness | 7.368680931 |
| Sum | 7.064667055 × 1011 |
| Variance | 6.444437231 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350000 | 143 | 0.8% |
| 450000 | 138 | 0.8% |
| 425000 | 132 | 0.8% |
| 550000 | 131 | 0.7% |
| 500000 | 128 | 0.7% |
| 325000 | 118 | 0.7% |
| 375000 | 117 | 0.7% |
| 400000 | 113 | 0.6% |
| 300000 | 111 | 0.6% |
| 250000 | 107 | 0.6% |
| Other values (3515) | 16274 |
| Value | Count | Frequency (%) |
| 75000 | 1 | |
| 78000 | 1 | |
| 80000 | 1 | |
| 81000 | 1 | |
| 82000 | 1 | |
| 82500 | 1 | |
| 83000 | 1 | |
| 84000 | 1 | |
| 85000 | 2 | |
| 89000 | 1 |
| Value | Count | Frequency (%) |
| 4668000000 | 1 | |
| 4489000000 | 1 | |
| 4208000000 | 1 | |
| 3635000000 | 1 | |
| 3567000000 | 1 | |
| 3395000000 | 1 | |
| 3345000000 | 1 | |
| 3278000000 | 1 | |
| 3204000000 | 1 | |
| 3075000000 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 992.0 KiB |
| 3 | |
|---|---|
| 4 | |
| 5 | |
| 2 | 139 |
| 1 | 25 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17512 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17512 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17512 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 11375 | |
| 4 | 4600 | |
| 5 | 1373 | 7.8% |
| 2 | 139 | 0.8% |
| 1 | 25 | 0.1% |
| Distinct | 7553 |
|---|---|
| Distinct (%) | 43.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12599.49577 |
| Minimum | 659 |
|---|---|
| Maximum | 871200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 659 |
|---|---|
| 5-th percentile | 2037.65 |
| Q1 | 5100 |
| median | 7626 |
| Q3 | 10084.5 |
| 95-th percentile | 36612.25 |
| Maximum | 871200 |
| Range | 870541 |
| Interquartile range (IQR) | 4984.5 |
Descriptive statistics
| Standard deviation | 26430.82805 |
|---|---|
| Coefficient of variation (CV) | 2.097768714 |
| Kurtosis | 137.4178008 |
| Mean | 12599.49577 |
| Median Absolute Deviation (MAD) | 2514 |
| Skewness | 9.244205444 |
| Sum | 220642370 |
| Variance | 698588671.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 352 | 2.0% |
| 4000 | 296 | 1.7% |
| 6000 | 238 | 1.4% |
| 7200 | 176 | 1.0% |
| 4800 | 115 | 0.7% |
| 7500 | 113 | 0.6% |
| 4500 | 97 | 0.6% |
| 3600 | 93 | 0.5% |
| 8400 | 91 | 0.5% |
| 4080 | 86 | 0.5% |
| Other values (7543) | 15855 |
| Value | Count | Frequency (%) |
| 659 | 1 | < 0.1% |
| 660 | 1 | < 0.1% |
| 748 | 1 | < 0.1% |
| 750 | 3 | |
| 755 | 1 | < 0.1% |
| 758 | 1 | < 0.1% |
| 794 | 1 | < 0.1% |
| 810 | 2 | |
| 886 | 3 | |
| 887 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 871200 | 1 | |
| 560617 | 1 | |
| 438213 | 1 | |
| 434728 | 1 | |
| 425581 | 1 | |
| 422967 | 1 | |
| 392040 | 2 | |
| 386812 | 1 | |
| 380279 | 1 | |
| 360000 | 1 |
| Distinct | 939 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2081.370774 |
| Minimum | 290 |
|---|---|
| Maximum | 13540 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 136.9 KiB |
Quantile statistics
| Minimum | 290 |
|---|---|
| 5-th percentile | 935.5 |
| Q1 | 1430 |
| median | 1920 |
| Q3 | 2550 |
| 95-th percentile | 3760 |
| Maximum | 13540 |
| Range | 13250 |
| Interquartile range (IQR) | 1120 |
Descriptive statistics
| Standard deviation | 918.9428382 |
|---|---|
| Coefficient of variation (CV) | 0.4415084758 |
| Kurtosis | 5.645995735 |
| Mean | 2081.370774 |
| Median Absolute Deviation (MAD) | 550 |
| Skewness | 1.499112814 |
| Sum | 36448965 |
| Variance | 844455.9399 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1440 | 113 | 0.6% |
| 1400 | 110 | 0.6% |
| 1300 | 108 | 0.6% |
| 1480 | 104 | 0.6% |
| 1540 | 103 | 0.6% |
| 1560 | 101 | 0.6% |
| 1820 | 100 | 0.6% |
| 1720 | 100 | 0.6% |
| 1010 | 100 | 0.6% |
| 1660 | 100 | 0.6% |
| Other values (929) | 16473 |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 380 | 1 | |
| 384 | 1 | |
| 390 | 1 | |
| 420 | 2 | |
| 430 | 1 | |
| 440 | 1 | |
| 470 | 2 | |
| 480 | 2 | |
| 490 | 1 |
| Value | Count | Frequency (%) |
| 13540 | 1 | |
| 12050 | 1 | |
| 10040 | 1 | |
| 9640 | 1 | |
| 9200 | 1 | |
| 8670 | 1 | |
| 8020 | 1 | |
| 8010 | 1 | |
| 7880 | 1 | |
| 7850 | 1 |
tiene_sotano
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 992.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17512 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17512 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17512 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10655 | |
| 1 | 6857 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 992.0 KiB |
| 0 | |
|---|---|
| 1 | 732 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 17512 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17512 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17512 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 16780 | |
| 1 | 732 | 4.2% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | zipcode | grade | sqft_basement | view | bathrooms | bedrooms | sqft_above | sqft_living15 | lat | waterfront | floors | yr_renovated | yr_built | long | jhygtf | sqft_lot | price | condition | sqft_lot15 | sqft_living | tiene_sotano | fue_renovada | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 19857 | 98006 | 10 | NaN | 0 | 2.0 | 3.0 | 2610.0 | 3140.0 | 47.5535 | 0 | 2.0 | NaN | 1993.0 | -122115.00 | 0.0 | 8481.0 | 810000.0 | 3 | 10008.0 | 2610.0 | 0 | 0 |
| 1 | 14014 | 98033 | 8 | 650.0 | 1 | 1.0 | 3.0 | 1560.0 | 2210.0 | 47.6621 | 0 | 1.0 | NaN | 1974.0 | -122189.00 | 0.0 | 8955.0 | 685000.0 | 3 | 8976.0 | 2210.0 | 1 | 0 |
| 2 | 32909 | 98005 | 8 | NaN | 0 | 2.0 | 4.0 | 2650.0 | 2230.0 | 47.6075 | 0 | 2.0 | NaN | 1986.0 | -122154.00 | 0.0 | 18295.0 | 725000.0 | 3 | 19856.0 | 2650.0 | 0 | 0 |
| 3 | 16305 | 98001 | 7 | 900.0 | 0 | 1.0 | 5.0 | 1050.0 | 1660.0 | 47.3381 | 0 | 1.0 | NaN | 1962.0 | -122289.00 | 0.0 | 8720.0 | 274000.0 | 3 | 8030.0 | 1950.0 | 1 | 0 |
| 4 | 6647 | 98011 | 7 | 320.0 | 0 | 2.0 | 3.0 | 1310.0 | 1620.0 | 47.7275 | 0 | 1.0 | NaN | 1986.0 | -122232.00 | 0.0 | 6449.0 | 445000.0 | 3 | 7429.0 | 1630.0 | 1 | 0 |
| 5 | 5865 | 98040 | 8 | 850.0 | 0 | 2.0 | 4.0 | 1760.0 | 2550.0 | 47.5875 | 0 | 1.0 | NaN | 1978.0 | -122229.00 | 0.0 | 8760.0 | 762500.0 | 4 | 10376.0 | 2610.0 | 1 | 0 |
| 6 | 8009 | 98004 | 8 | NaN | 1 | 1.0 | 3.0 | 1700.0 | 2630.0 | 47.6166 | 0 | 1.0 | NaN | 1954.0 | -122.22 | 0.0 | 14133.0 | 979000.0 | 4 | 17376.0 | 1700.0 | 0 | 0 |
| 7 | 4731 | 98011 | 8 | 780.0 | 0 | 3.0 | 5.0 | 2090.0 | 2640.0 | 47.7449 | 0 | 2.0 | NaN | 2007.0 | -122192.00 | 0.0 | 4369.0 | 540000.0 | 3 | 4610.0 | 2870.0 | 1 | 0 |
| 8 | 38480 | 98052 | 9 | NaN | 0 | 2.0 | 4.0 | 2700.0 | 2730.0 | 47.7041 | 0 | 2.0 | NaN | 2004.0 | -122116.00 | 0.0 | 8810.0 | 690000.0 | 3 | 5100.0 | 2700.0 | 0 | 0 |
| 9 | 13246 | 98072 | 7 | 530.0 | 0 | 1.0 | 3.0 | 1130.0 | 1260.0 | 47.7628 | 0 | 1.0 | NaN | 1976.0 | -122162.00 | 0.0 | 9673.0 | 375000.0 | 3 | 9681.0 | 1660.0 | 1 | 0 |
Last rows
| df_index | zipcode | grade | sqft_basement | view | bathrooms | bedrooms | sqft_above | sqft_living15 | lat | waterfront | floors | yr_renovated | yr_built | long | jhygtf | sqft_lot | price | condition | sqft_lot15 | sqft_living | tiene_sotano | fue_renovada | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17502 | 49558 | 98198 | 6 | NaN | 2 | 1.0 | 2.0 | 1170.0 | 1380.0 | 47.4017 | 0 | 1.0 | NaN | 1911.0 | -122321.00 | 0.0 | 8925.0 | 1.750000e+05 | 3 | 7440.0 | 1170.0 | 0 | 0 |
| 17503 | 146 | 98117 | 6 | 120.0 | 0 | 1.0 | 2.0 | 860.0 | 980.0 | 47.6769 | 0 | 1.0 | NaN | 1918.0 | -122366.00 | 0.0 | 2130.0 | 4.000000e+05 | 4 | 2800.0 | 980.0 | 1 | 0 |
| 17504 | 62153 | 98024 | 12 | NaN | 0 | 4.0 | 5.0 | 6070.0 | 4680.0 | 47.5954 | 0 | 2.0 | NaN | 1999.0 | -121.95 | 0.0 | 171626.0 | 1.550000e+06 | 3 | 211267.0 | 6070.0 | 0 | 0 |
| 17505 | 39142 | 98034 | 10 | NaN | 2 | 2.0 | 3.0 | 2510.0 | 2560.0 | 47.7051 | 0 | 2.0 | NaN | 2006.0 | -122223.00 | 0.0 | 4600.0 | 1.185000e+09 | 3 | 7500.0 | 2510.0 | 0 | 0 |
| 17506 | 9396 | 98065 | 7 | NaN | 0 | 2.0 | 3.0 | 1950.0 | 2190.0 | 47.5194 | 0 | 2.0 | NaN | 2007.0 | -121869.00 | 0.0 | 7263.0 | 4.090000e+05 | 3 | 5900.0 | 1950.0 | 0 | 0 |
| 17507 | 14466 | 98198 | 7 | NaN | 0 | 2.0 | 4.0 | 1780.0 | 1630.0 | 47.3828 | 0 | 2.0 | NaN | 1991.0 | -122302.00 | 0.0 | 6000.0 | 1.750000e+05 | 3 | 6000.0 | 1780.0 | 0 | 0 |
| 17508 | 30056 | 98042 | 6 | NaN | 0 | 1.0 | 3.0 | 840.0 | 920.0 | 47.3607 | 0 | 1.0 | NaN | 1969.0 | -122085.00 | 0.0 | 5525.0 | 1.910000e+05 | 5 | 5330.0 | 840.0 | 0 | 0 |
| 17509 | 5824 | 98106 | 7 | 550.0 | 0 | 2.0 | 3.0 | 1230.0 | 1780.0 | 47.5237 | 0 | 1.0 | NaN | 1990.0 | -122353.00 | 0.0 | 6771.0 | 3.100000e+05 | 3 | 6771.0 | 1780.0 | 1 | 0 |
| 17510 | 16712 | 98038 | 7 | NaN | 0 | 2.0 | 3.0 | 1340.0 | 1060.0 | 47.3839 | 0 | 2.0 | NaN | 1995.0 | -122038.00 | 0.0 | 3011.0 | 2.300000e+05 | 3 | 3232.0 | 1340.0 | 0 | 0 |
| 17511 | 237 | 98075 | 10 | NaN | 0 | 2.0 | 3.0 | 3240.0 | 2970.0 | 47.5857 | 0 | 2.0 | NaN | 1994.0 | -122038.00 | 0.0 | 7857.0 | 8.000000e+05 | 3 | 7857.0 | 3240.0 | 0 | 0 |